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(54) TiUe: C/S-ELEMENT REPORTER CONSTRUCTS AND USES THEREOF 
(57) Abstract 



The present invention provides cis element-reporter constructs for measuring transcription. A series of cis elements are linked to a 
reporter gene to generate cw-acting reporters. Examples of reporter genes are secreted alkaline phosphatase (SEAP), destabilized green 
fluorescent protein (d2EGFP) and luciferase. Also provided are various applications of such constructs. 
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BACKGROUND OF THE INVENTION 

Cross-Reference to Related Applications 

This application claims benefit of priority of pending non- 
10 provisional application U.S. Serial No. 09/206,887, filed December 8, 
1998. 

Field of the Invention 

The present invention relates generally to the field of 
molecular biology of transcription. More specifically, the present 
15 invention relates to cis element reporter constructs for measuring 
transcription. 

Description o f t h e Related Art 

Regulation of the activity of transcription factors is believed 

20 to be one of the most important biochemical controls to signal 
transduction, cell proliferation, differentiation, apoptosis, 
development, and disease. Studying the regulation of an individual 
transcription factor in various specific biological functional system has 
yielded a tremendous amount of information; however, the 

25 information that one can obtain from the current strategies of studying 
an individual factor is limited, despite the global functional status of a 
number of transcription factors. The information is limited because 
cross-talking among the different signal transduction pathways leads to 
activation of multiple transcription factors. Therefore, there is a need 
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to provide a tool to study the interaction and overall effect of various 

transcription factors. 

Green fluorescent protein (GFP) from the jellyfish Aequo rea 
victoria is a reporter molecule for monitoring gene expression and 
protein localization in vivo, in situ and in real time (1-4). GFP 
fluoresces bright green upon mere exposure to UV or blue light-unlike 
other bioluminescent reporters which require additional proteins, 
substrates, or cofactors to emit light. GFP fluorescence is stable, 
species independent and can be monitored noninvasively in living cells. 
GFP fluorescence persists in formaldehyde-fixed cells and is well suited 
for double-labeling experiments with other fluorescent markers, 
including the GFP variant enhanced green fluorescent protein (EGFP, 
GFP mutl ; 5-6). 

EGFP encodes a protein which has a single, red-shifted 
excitation peak and fluoresces about 35 times more intensively than 
wild type GFP when excited at 488 nm (7), due to an increase in its 
extinction coefficient (Em). To ensure maximal mammalian 

expression, the coding region of EGFP contains more than 190 silent 
base mutations which correspond to human codon-usage preferences 
(7). The red-shifted spectrum and increased expression of EGFP make 
it ideal for fluorescence microscopy and fluorescence-activated cell 

sorting (FACS; 5, 8). 

Destabilized EGFP (d2EGFP) contains the PEST domain from 
mouse ornithine decarboxylase (MODC), fused to the C-terminus of 
5 EGFP (9). This domain targets EGFP for rapid turnover effectively 
reducing EGFP's half-life to two hours. The introduction of d2EGFP 
greatly increases the utility of GFP in studying dynamic cellular-events 
in vivo. 

Another reporter molecule, secreted alkaline phosphatase 
0 (SEAP, 10-11), is a secreted form of human placental alkaline 
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phosphatase and has been used as a reporter for the analysis of ex- 
acting DNA sequences and trans-acting factors. Secreted alkaline 
phosphatase is efficiently secreted from cells, and the levels of SEAP 
activity in the culture medium are directly proportional to changes in 
5 intracellular concentrations of SEAP mRNA and protein (10, 12). 
Secreted alkaline phosphatase differs from native placental alkaline 
phosphatase (AP) only in the absence of a membrane-anchoring 
domain. The secreted nature of secreted alkaline phosphatase offers 
many advantages over other transcription reporter molecules. Most 
10 notably, it is not necessary to prepare cell lysates to assay for the 
reporter protein. Because the transfected cells remain intact, the 
kinetics of gene expression can be studied easily using the same 
cultures by sampling the medium repeatedly; therefore, set-up and 
variability for multiple cultures are essentially eliminated. 
15 Another important advantage of secreted alkaline 

phosphatase as a transcription reporter is that background signals due 
to endogenous alkaline phosphatases are nearly absent. Unlike most 
endogenous alkaline phosphatases, secreted alkaline phosphatase is 
extremely heat-stable and resistant to the inhibitor L-homoarginine. 
20 Therefore, endogenous alkaline phosphatase activity can be eliminated 
effectively by preheating the culture medium sample as 65 °C and 
assaying in the presence of L-homoarginine. 

In addition, luciferase is a widely used reporter. It is one of 
most sensitive reporters and like GFP and SEAP, may be fused to a 
25 transcription factor of interest and used to track the activation of the 
transcription factor of interest. 

The prior art is deficient in the lack of effective means of 
detecting and measuring the activities of a number of transcription 
factors. Further, the prior art is deficient in the lack of a construct for 
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measuring transcription. The present invention fulfills this long- 
standing need and desire in the art. 

5 SUMMARY OF THE INVENTION 

The present invention is directed to novel cis-acting 
reporters for measuring transcription. In one embodiment of the 
present invention, there is provided a cis element-reporter construct 
10 comprising a cis element, a reporter gene and a promoter. 
Representative cis elements include nuclear factor-KB, heat responsive 
element, Myc, P 53, nuclear factor of activated T cells, activator protein 
1 (API), serum responsive element, glucocorticoid responsive element 
and cAMP responsive element. Representative reporter genes include 
15 secreted alkaline phosphatase (SEAP), destabilized green fluorescent 
protein (d2EGFP) and luciferase (Luc). Representative promoters 
include thymidine kinase promoter and gonadotropin cc-gene 
promoter. 

In another embodiment of the present invention, there is 
20 provided a method of generating a cis element-reporter construct by 
linking a cis element to a promoter and then fusing the element to a 
reporter gene. 

In yet another embodiment of the present invention, there 
is provided a method of monitoring activation of a transcription factor, 
25 comprising the steps of generating a vector comprising a cis element, a 
reporter gene and a promoter, wherein the cis element is a component 
of a DNA sequence to which said transcription factor binds to ensure 
activity; transfecting a cell line with the vector; and detecting 
expression of the reporter gene, wherein expression of the reporter 
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gene indicates activation of the transcription factor. Representative 

cell lines include CHO, HEK 293, Saos2 and HeLa. 

In still yet another embodiment of the present invention, 

there is provided a method of determining whether a gene is involved 
5 in a signal transduction pathway, comprising the steps of generating a 

vector comprising a cis element, a reporter gene and a promoter, 

wherein the cis element is a component of a DNA sequence which 

affects the expression of the a gene; expressing the vector in a cell line, 

wherein the cell line is associated with the signal transduction pathway; 
10 and detecting the expression of the reporter gene, wherein expression 

of the reporter gene indicates the involvement of the gene in the signal 

transduction pathway. 

Other and further aspects, features, and advantages of the 

present invention will be apparent from the following description of 
15 the presently preferred embodiments of the invention given for the 

purpose of disclosure. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



So that the matter in which the above-recited features, 
advantages and objects of the invention, as well as others which will 
become clear, are attained and can be understood in detail, more 
particular descriptions of the invention briefly summarized above may 

25 be had by reference to certain embodiments thereof which are 
illustrated in the appended drawings. These drawings form a part of 
the specification. It is to be noted, however, that the appended 
drawings illustrate preferred embodiments of the invention and 
therefore are not to be considered limiting in their scope. 

30 Figure 1 shows the restriction map of pNF-KB-SEAP. 
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Figure 2 shows the restriction map of pNF-KB-d2EGFP. 
Figure 3 shows the restriction map of pNF-xB-Luc. 
Figure 4 shows mediated induction of transcription factors 
in HEK 293 cells. Twenty four hours after transfection, cells were 
treated with an inducer for 6 hours. Samples were collected and 
measured for SEAP activity. Figure 4A shows TNF-mediated 
transcriptional induction. Cells were treated with 100 ng/ml TNF; NFkB 
and NFAT signaling pathways exhibit 4-fold and 10-fold induction, 
respectively, when activated by TNF. Figure 4B shows serum-mediated 
induction of transcription factors in HEK 293 cells. Cells were fed with 
10% serum. Multiple pathways were induced, when activated by 
serum. In both experiments, pTK-SEAP was used as the negative 
control. (RLU = relative light units) 



DETAILED DESCRIPTION OF THE INVENTION 

The following terms have the definitions set below. 
As used herein, "cis-element" means a genetic element 
having the ability to affect the activity of a gene on the same DNA 
molecule as that genetic element. 

As used herein, "reporter" means a molecule (usually a 
protein) that is expressed in response to or as a result of a particular 
biological or molecular event. 

As used herein, "cis-acting reporter" means a protein that is 
expressed in a measurable manner in response to a molecular event 
that takes place on the same DNA strand that the DNA which codes for 
the protein is located. 
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As used herein, "transcription factor" means any factor that 
controls genetic transcription. 

As used herein, "signal transduction pathway" means the 
pathway or sequence of molecular events which converts an 
5 extracellular signal into cellular activity. 

The present invention is directed to a series of cis element- 
reporter constructs and uses thereof. 

In one embodiment of the present invention, there is 
provided a cis element-reporter construct comprising a cis element, a 
10 reporter gene and a promoter. Representative cis elements include 
nuclear factor-KB, heat responsive element, Myc, p53, nuclear factor of 
activated T cells, activator protein 1, serum responsive element, 
glucocorticoid responsive element and cAMP responsive element. In 
one preferred embodiment, the nuclear factor-KB has the sequence 
15 shown in SEQ ID No: 1. In one preferred embodiment, the heat 
responsive element has the sequence shown in SEQ ID No: 2. In one 
preferred embodiment, the Myc has the sequence shown in SEQ ID No: 
3. In one preferred embodiment, the p53 has the sequence shown in 
SEQ ID No: 4. In one preferred embodiment, the nuclear factor of 
20 activated T cells has the sequence shown in SEQ ID No: 5. In one 
preferred embodiment, the activator protein 1 has the sequence shown 
in SEQ ID No: 6. In one preferred embodiment, the serum responsive 
element has the sequence shown in SEQ ID No: 7. In one preferred 
embodiment, the glucocorticoid responsive element has the sequence 
25 shown in SEQ ID No: 8. In one preferred embodiment, the cAMP 
responsive element has the sequence shown in SEQ ID No: 9 or SEQ ID 
No: 10. Representative reporter genes include secreted alkaline 
phosphatase (SEAP), destabilized green fluorescent protein (d2EGFP) 
and luciferase (Luc). Representative promoters include thymidine 
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kinase promoter, gonadotropin a-gene promoter, and other known by 
those having ordinary skill in this art. 

In another embodiment of the present invention, there is 
provided a method of generating a cis element-reporter construct by 
linking a cis element to a promoter and then fusing the element to a 
reporter gene. 

In yet another embodiment of the present invention, there 
is provided a method of monitoring activation of a transcription factor, 
comprising the steps of generating a vector comprising a cis element, a 
reporter gene and a promoter, wherein the cis element is a component 
of a DNA sequence to which said transcription factor binds to ensure 
activity; transfecting a cell line with the vector; and detecting 
expression of the reporter gene, wherein expression of the reporter 
gene indicates activation of the transcription factor. Representative 
cell lines include CHO, HEK 293, Saos2 and HeLa. 

In still yet another embodiment of the present invention, 
there is provided a method of determining whether a gene is involved 
in a signal transduction pathway, comprising the steps of generating a 
vector comprising a cis element, a reporter gene and a promoter, 
wherein the cis element is a component of a DNA sequence which 
affects the expression of a test gene; expressing the vector in a cell 
line, wherein the cell line is associated with the signal transduction 
pathway, and detecting the expression of the reporter gene, wherein 
expression of the reporter gene indicates the involvement of the gene 
in the signal transduction pathway. 

The following examples are given for the purpose of 
illustrating various embodiments of the invention and are not meant to 
limit the present invention in any fashion. 
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EXAMELE 1 



(7fc-F.lp.mp.nt. 

The following cis elements were utilized for constructing 

5 cis-acting reporters: 

NF-kB (nuclear factor-xB) having the sequence of GGGAA 
TTTCCGGGAATTTCCGGGAATTTCCGGGAATTTCC (SEQ ED No: 1); 

HRE (heat responsive element) having the sequence of 
TAGAATGTTCTAGATCTAGAACATTCTAGCTAGAATGTTCTAG (SEQ ID No: 

10 2); 

Myc having the sequence of CACGTGCACGTGCACGTGCACG 
TGCACGTGCACGTG (SEQ ID No: 3); 

p53 having the sequence of TCGAGCTTGCCTGGACTTGCCTG 
GGCCAGATCTGTCGACGGAGGTCGAGCTTGCCTGGACTTGCCTGCCCACATCT 
1 5 GTCGACGGAGGTCGAGCTTGCCTGGACTTGCCTGCCCACATCTGTCGACGGAG 
GTCGAGCCTTGCCTGGACTTGCCTGCCA (SEQ ID No: 4); 

NFAT (nuclear factor of activated T cells) having the 
sequence of GGAGGAAAAACTGTTTCATACAGAAGGCGTGGAGGAAAAA 
CTGTTTCATACAGAAGGCGTGGAGGAAAAACTGTTTCATACAGAAGGCGT 

20 (SEQ ID No: 5); 

API (activator protein 1) is an active protein and its activity 
is composed primarily by Jun/Fos heterodimers as well as Jun 
homodimers. The activity of API responds to growth factors, such as 
phorbol ester (TP A), protein kinase C. Apl having the sequence of 

2 5 TGAGTC AGTG AGTC ACTG ACTC ACTG ACTC ATG AGTC AGCTG 

ACTC (SEQ ID No: 6); 

SRE (serum responsive element) having the sequence of 
GATGTCCATATTAGGACATCGATGTCCGAATATGGACATCGATGTCCATATT 

AGGACATC (SEQ ID No: 7); 
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GRE (glucocorticoid responsive element) having the 
sequence of GGTACATTTTGTTCTAGAACAAAATGTACCGGTACATTTTGT 
TCT (SEQ ID No: 8); 

CRE stands for cAMP responsive element. Two CRE elements 
5 were utilized: CRE3 (GCACCAGACAGTGACGTCAGCTGCCAGAT 

CCCATGGCCGTCATACTGTGACGTCTTTCAGACACCCCATTGACGTCAATGGG 
AGAAC: SEQ ID No: 9) and CRE5 (TCGAGCCCATGGCCGTCATACTGTGACG 
TCCCCATGGCCGTCATACTGTGACGTCCCCATGGCCGTCATACTGTGACGTCC 
CCATGGTCGTCATACTGTGACGTCCCCATGGCCGTCATACTGTGACGTC: SEQ 
10 ID No:10). 



EXAMPLE 2 



r/c-Rlftment/SRAP Constructs 

15 A number of cis elements were linked to a secreted alkaline 

phosphatase reporter wherein the secreted alkaline phosphatase wa 
used as a reporter. One advantage of using SEAP as a reporter is that 
SEAP is secreted into the medium and the medium itself can be 
collected for the assay without disrupting the cells. 

20 In a AP1-SEAP construct, the cis element in the construct 

contains six copies of API binding sequence. In a SRE-SEAP construct, 
the construct contains three copies of SRE element. In a CRESEAP 
construct, the construct contains three different cAMP responsive 
elements, responding to cAMP, protein kinase A, forskolin etc. In a 

25 GRE-SEAP construct, the construct contains three copies of 
glucocorticoid responsive element and responds to the glucocorticoid- 
activated glucocorticoid receptor. In a HRE-SEAP construct, the 
construct contains three copies of heat responsive element. It 
responds to a number of types of physiological stress, such as heat. In 

30 an NF-kB-SEAP construct, the construct contains four copies of the NF- 

10 
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KB binding sequence and responds to a number of stimuli, including 
TNF. In a NFAT-SEAP construct, the construct contains three copies of 
NFAT binding sequence and responds to PMA, ionomycin, PKC, etc. In a 
Myc-SEAP construct, the construct contains six copies of myc binding 
sequence and responds to myc activation. In a P53-SEAP construct, the 
construct contains two copies of P 53 binding sequence, reporting P 5 3 
activation. 

EXAMELE-3 
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pHP-vR-SF.AP Construct 

pNF-KB-SEAP (SEQ ID No:ll) is especially designed for 
monitoring the activation of NF-kB signal transduction pathway (13- 
16). The restriction map of P NF-kB-SEAP is shown in Figure 1. 
15 Induction of the pathway enables endogenous NF-kB to bind to the 
kappa (k) enhancer element (KB 4 ) located in the promoter region of 
the vector, thus activating transcription of the reporter gene. 

pNF-KB-SEAP allows expression of the secreted alkaline 
phosphatase reporter gene (17-19). The secreted alkaline phosphatase 
20 reporter coding sequence is followed by the SV40 late polyadenylation 
signal to ensure proper, efficient processing of the secreted alkaline 
phosphatase reporter transcript in eukaryotic cells. A synthetic 
transcription blocker (TB), composed of adjacent polyadenylation and 
transcription pause sites located upstream of the x-enhancer, reduces 
25 background transcription (20). The vector backbone also contains an 
fl origin for single-stranded DNA production, a pUC origin of 
replication, and an ampicillin resistance gene for propagation and 
selection in E.coli. The SEAP vectors incorporate a number of 
additional features that improve the sensitivity of secreted alkaline 
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phosphatase reporter by increasing the efficiency of secreted alkaline 
phosphatase reporter expression or that enhance the utility of the 
vectors. These include: the removal of the SV40 small-t intron, which 
can cause cryptic splicing and reduced expression in some genes 
and/or cell types (21-22); switching from the early to late 
polyadenylation signal of SV40, which typically causes a five-fold 
increase in mRNA levels (23); compact plasmid size; and removal of 
extraneous sequence from the 3' untranslated region of the SEAP 
mRNA. 



EXAMPLE 4 



r/c-F.lpmpnt/F.rrFP Construct 

EGFP was also used as a reporter to analyze transcription 
factors. Destabilized EGFP (d2EGFP) is an autofluorescence protein 
wherein fluorescence emission of this protein does not require any 
additional substrates or cofactors. d2EGFP is an ideal reporter to be 
used in cell-based assays for high throughput drug screening as it 
allows real-time monitoring of activation of transcription factors. The 
detection can be made in multiple points. 

A set of d2EGFP reporters with different cis-elements were 
generated, which are used for monitoring different transcription 
factors. Establishment of stable cell lines that express individual 
reporters expands application of this cis-element reporter in the cell- 
based high throughput drug screening. 

An NF-KB-d2EGFP construct also is used to monitor the 
activation of NF-kB. This construct responds to a number of stimuli, 
including TNF and ILL The APl-d2EGFP construct is used to monitor 
the activation of API, which responds to growth factors. Additionally, 
an SRE-d2EGFP construct is used to monitor the activation of E1K1. This 

1 2 
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construct responds to serum. The GRE-d2EGFP construct was used to 
monitor the activation of glucocorticoid receptors and responds to 
glucocorticoid activation. The CRE-d2EGFP construct was used to 
monitor CREB activation and responds to cAMP and cAMP-mediated 
5 protein kinase A (PKA). An NFAT-d2EGFP construct was used to 
monitor Ca++-mediated NFAT activation which responds to Ca++. The 
HSE-d2EGFP construct is used to monitor HSF activation and responds 
to a stress, such as heat. The P53-d2EGFP construct is used to monitor 
p53 activation, responding to DNA damage. The Myc-d2EGFP construct 

10 is used to monitor Myc activation, which responds to cell growth. 

To construct the cis-acting d2EGFP reporters, the binding 
sequences of cis element was fused to the thymidine kinase (TK) 
promoter and d2EGFP gene. Resulting constructs are able to monitor 
the activation of a number of transcription factors. In CRE-d2EGFP 

1 5 transfected CHO cells, forskolin treatment elevates intracellular cAMP 
resulting in induction of d2EGFP. In NFAT-d2EGFP transfected HEK 29 3 
cells, ionomycin and PMA treatment activates NFAT resulting in 
induction of d2EGFP. In GRE-d2EGFP transfected Saos2 cells, 
dexamethasone treatment activates glucocorticoid receptor (GR) 

20 resulting induction of d2EGFP. 

EXAMPLE 5 

pNF-vR.H9F .nFP Pnnstrnr.t 
25 By linking NF-kB binding sequences and the thymidine 

kinase promoter, d2EGFP gene was able to detect TNF-mediated 
activation of NF-kB. pNF-KB-d2EGFP (SEQ ID No:12) was designed to 
monitor the activation of the NF-kB signal transduction pathway (13- 
16). The restriction map of pNF-KB-d2EGFP is shown in Figure 2. 
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Induction of the pathway enables endogenous NF-kB to bind to the 
kappa (k) enhancer element (KB 4 ) located in the promoter region of 
the vector, thus activating transcription of the reporter gene. 

d2EGFP is a destabilized variant of the enhanced green 
fluorescent protein. Unlike the original EGFP protein which is 
extremely stable, d2EGFP has a half-life of approximately two hours in 
vivo (24). Therefore, d2EGFP provides a more accurate measurement 
of transient activation by NF-kB than unmodified EGFP and is ideal for 
kinetic studies of gene activation. To construct d2EGFP, residues 422- 
461 of mouse ornithine decarboxylase (MODC) were fused to the C- 
terminus of EGFP. This region of the mouse ornithine decarboxylase 
contains a PEST amino acid sequence that targets the protein for 
degradation results in rapid protein turnover. pNF-KB-d2EGFP allows 
for real-time analysis of pNF-icB activation by either fluorescence 

microscopy or flow cytometry. 

The d2EGFP coding sequence is followed by the SV40 late 
polyadenylation signal to ensure proper, efficient processing of the 
d2EGFP transcript in eukaryotic cells. A synthetic transcription blocker 
(TB), composed of adjacent polyadenylation and transcription pause 
sites located upstream of the k enhancer reduces background 
transcription (20). The vector backbone also contains fl origin for 
single-stranded DNA production, a pUC origin of replication, and an 
ampicillin resistance gene for propagation and selection in E.coli. The 
d2EGFP vectors incorporate a number of features that improve the 
sensitivity of d2EGFP by increasing the efficiency of d2EGFP expression 
or that enhance the utility of the vectors. These include: the removal 
of the SV40 small-t intron, which can cause cryptic splicing and 
reduced expression in some genes and/or cell types (21-22); switching 
from the early to late polyadenylation signal of SV40, which typically 
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causes a five-fold increase in mRNA levels (23); compact plasmid size; 
and removal of extraneous sequence from the 3' untranslated region of 
the d2EGFP mRNA. 

5 EXAMPLES 

rPFS-H9FnFP Construct 

pCRE5-d2EGFP (SEQ ID No: 13) was constructed using the 
pTK-SEAP as the backbone. The thymidine kinase promoter was 

10 substituted by 5 copies of CREs (cAMP response elements) fused with 
gonadotropin cc-gene promoter (also called Thyroid-stimulating 
hormone TSHoc subunit). The d2EGFP reporter gene was used instead of 
secreted alkaline phosphatase. This construct was designed to monitor 
the cAMP level changes in vivo. cAMP elevation within cells upon 

15 stimulation should result in the phosphorylation of CREB, which 
consequenly induces the transcription and translation of the reporter 
gene d2EGFP. The destabilized form of EGFP eliminated the 
accumulation of EGFP in cells and better represented the changes of its 
expression level upon stimulation. 

20 The stable cell line was generated by stably transfected 

pCRE5-d2EGFP together with pSV2-neo into CHO-K1 cells. Transfected 
cells were selected under G418 at a 500 \ig/m\ concentration and 
individual clones were isolated and analyzed for forskolin induction. 
Three cell clones were selected which show a time course as well as the 

25 positive response to forskolin. Clone CRE5-dE #3 has the highest 
fluorescence intensity and is the easiest to detect induction, since it 
had five fold increase of fluorescence after induction. Clone CRE5-dE 
#38 has the highest fold of induction (10 fold) and the lowest 
background EGFP expression. However, after induction, Clone CRE5-dE 
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#38 had a lower fluorescent intensity than that of clone #3. Clone #3 8- 
2 is an intermediate of above two clones, having an 8 fold fluorescence 
induction and, after induction, the fluorescence intensity was at the 
similar level of clone #3. These cell clones can be used in cell based 
5 high-throughput screening in search of factors involved in cAMP signal 
transduction pathway. 



EXAMPLE 7 



10 r/r-Rlftme.nt/Luciferasp, Construct 

Luciferase was also used as a reporter to analyze 
transcription factors. Luciferase is one of most sensitive reporters and 
is widely used in the art. Due to its sensitivity and popularity, 
luciferase was selected to link to a cis element to form a cis- 

15 element/luciferase construct for monitoring the activation of 
transcription factors. 

pNF-KB -Luciferase (pNF-KB-Luc, SEQ ID No: 14) contains a 
firefly luciferase gene and is designed for monitoring the activation of 
the NF-kB signal transduction pathway (13-16). The restriction map of 

20 pNF-KB-Luc is shown in Figure 3. Induction of the pathway enables 
endogenous NF-kB to bind to the kappa (k) enhancer element (KB 4 ) 
located in the promoter region of the vector, thus activating 
transcription of the reporter gene. The luciferase coding sequence is 
followed by the SV40 late polyadenylation signal to ensure proper, 

25 efficient processing of the luciferase transcript in eukaryotic cells. A 
synthetic transcription blocker, composed of adjacent polyadenylation 
and transcription pause sites located upstream of the K-enhancer, 
reduces background transcription (20). The vector backbone also 
contains an fl origin for single-stranded DNA production, a pUC origin 
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of replication, and an ampici.lin resistance gene for propagation and 

selection in E.coli. 

The pNF-KB vectors incorporate a number of features that 

imp rove the sensitivity of luciferase by increasing the efficiency of 

5 .uoiferase expression or tha, enhance the utility of the vectors. These 

include: the removal of the SV40 small-, intron, which can cause 

cryptic spiicing and reduced expression in some genes and/or eel 

types (2,-22); switching from the early to late po.yadenylation s.gnal 

of SV40, which typically causes a five-fold increase in mRNA levels 

,0 (23); compact plasmid size; and removal of extraneous sequence from 

the 3' untranslated region of the luciferase mRNA. 

pyaMPI.F.S 

Tnn ^,i r .i„n Fnrtn- in Signal Pathways 

The activity assay of the present invention may be carr.ed 
out in a 12 or 24 well plate. This assay can be applied to a lot of 
studies, such as examining whether a new target gene links to any 
20 signa, pathways. The assay can be used for establishing a funcuona 
status profile of transcription factors for evaluating any 
difference in vivo, such as norma, cells versus differenUated, 
apoptotic, and cancer cells. The assays of the present invention can 
als „ be used to examine and quickly identify biologica. markers for 
25 human diseases. Drug validation is another application of the present 



assay 



30 



Figure 4 demonstrates mediated induction of transcription 
factors in HEK 293 cells. Figure 4A shows tumor necrosis factor (TNF)- 
otediated transcriptiona. induction. Human embryonic kidney (HEK 
293) ceUs were transfec.ed individually with each reporter vector and 
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incubated for 24 hr in serum-free media. After incubation with TNF for 
6 hr, samples were collected and measured for SEAP activity. The 
transcriptional profile reveals that, as expected, TNF activated the NFkB 
and the NFAT signaling pathways, which are known to play pivotal roles 
5 in the transcription of genes involved in the immune response (1-4). 
Other transcription factors and controls were unaffected by TNF 
stimulation. 

Figure 4B shows serum-mediated induction of transcription 
factors HEK293 cells. After transfection, cells were cultured in serum- 

10 free media for 24 hr and incubated with 10% serum for 6 hr. Samples 
were collected and measured for SEAP activity as described above. 
Unlike the TNF-mediated activation of specific pathways (Figure 4A), 
serum activates multiple transcription factors that induce a variety of 
signaling events after stimulation. 

15 The following references were cited herein. 

1. Chalfie, et al., (1994) Science 263: 802-805. 

2. Prasher, et al., (1992) Gene 111: 229-233. 

3. Inouye, et al., (1994) FEBS Letters 341: 277-280. 

4. Wang, et al., (1994) Nature 369: 400-403. 

20 5. Yang, et al., (1996) Nucleic Acid Res. 24(22): 4592-4593. 

6. Cormack, et al., (1996) Gene 173: 33-38. 

7. Hass, et al., (1996) Curr. Biol. 6: 315-324. 

8. Galbraith, et al., (1995) Methods. Cell Biol. 50: 1-12. 

9. Living Colors Destabilized EGFP Vectors (April 1998), 
25 CLONTECHniques XIII(2): 16-17. 

10. Berger, et al., (1998) Gene 66: 1-10. 

11. Kain, et al., (1997) Methods Mol. Biol. 63: 49-60. 

12. Cullen, et al., (1992) Methods Enzymol. 216: 362-368. 

13. Baeuerle, P.A. & Baltimore, D. (1996) Cell 87:13-20 
30 14. Baeuerle, P.A. (1998) Curr. Biol. 8:R19-R2 
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Peltz, G. (1997) Curr. Opin. Biotechnol. 8:467-473 
Israel, A. (1997) Nature 388: 519-521 
Berger, J. et al. (1998) Gene 66:1-10 
Kain, S.R. (1997) Methods Mol. Biol. 63:49-60 
Cullen, B.R. & Malim, M.H. (1992) Methods Enzymol 216:362- 
368 

Eggermont, J. & Proudfoot, N. (1993) EMBO J. 12:2539-2548 
Huang, M.T.F. & Gorman, CM. (1990) Mol. Cell. Biol. 10:1805- 
1810 

Evans, M.J. & Scarpulla, R.C. (1989) Gene 84:135-142 
Carswell, S. & Alwine, J.C. (1989) Mol. Cell. Biol. 9:5248-4258 

Any patents or publications mentioned in this specification 
are indicative of the levels of those skilled in the art to which the 
15 invention pertains. These patents and publications are herein 
incorporated by reference to the same extent as if each individual 
publication was specifically and individually indicated to be 
incorporated by reference. 

One skilled in the art will readily appreciate that the 
20 present invention is well adapted to carry out the objects and obtain 
the ends and advantages mentioned, as well as those inherent therein. 
The present examples along with the methods, procedures, treatments, 
molecules, and specific compounds described herein are presently 
representative of preferred embodiments, are exemplary, and are not 
25 intended as limitations on the scope of the invention. Changes therein 
and other uses will occur to those skilled in the art which are 
encompassed within the spirit of the invention as defined by the scope 
of the claims. 
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1 . A construct for measuring transcription, comprising: 

a cis element, wherein said cis element is selected from the 
group consisting of nuclear factor-KB having the sequence shown in 
SEQ ID No: 1, heat responsive element having the sequence shown in 
SEQ ID No: 2, Myc having the sequence shown in SEQ ID No: 3, p5 3 
having the sequence shown in SEQ ID No: 4, nuclear factor of activated 
T cells having the sequence shown in SEQ ID No: 5, alkaline 
phosphatase 1 having the sequence shown in SEQ ID No: 6, serum 
responsive element having the sequence shown in SEQ ID No: 7, 
glucocorticoid responsive element having the sequence shown in SEQ 
ID No: 8 and cAMP responsive element having the sequence selected 
from the group consisting of SEQ ID No: 9 and SEQ ID No: 10; 

a reporter gene; and 

a promoter. 

2. The construct of claim 1, wherein said reporter gene 
is selected from the group consisting of secreted alkaline phosphatase, 
destabilized green fluorescent protein and luciferase. 

3. The construct of claim 1, wherein said promoter is 
selected from the group consisting of thymidine kinase promoter and 
gonadotropin a-gene promoter. 

4. A vector comprising the construct of claim 1. 



20 
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5 . The vector of claim 4, wherein said vector is pNF-KB- 

SEAP. 



5 

d2EGFP. 



6 . The vector of claim 4, wherein said vector is pNF-xB- 



7 . The vector of claim 4, wherein said vector is pNF-KB- 

10 Luc. 



8. A method of monitoring activation of a transcription 

factor, comprising the steps of: 
15 generating the construct of claim 1, wherein said cis- 

element is a transcription factor; 

transforming a cell line with a vector containing said 

construct; and 

detecting the expression of said reporter gene, wherein 
20 expression of said reporter gene indicates the activation of said 
transcription factor. 

9. A method of monitoring activation of a transcription 

factor, comprising the steps of: 
25 generating the construct of claim 2; 

transforming a cell line with a vector containing said 

construct; and 



2 1 



5 



WO 00/34435 PCT/US99/28451 

detecting the expression of said reporter gene, wherein 

expression of said reporter gene indicates the activation of said 
transcription factor. 



10. The method of claim 9, wherein said cell line is 
selected from the group consisting of CHO, HEK 293 and Saos2. 



10 11. The method of claim 9, wherein said construct is 

selected from the group consisting of pNF-KB-SEAP, pNF-KB-d2EGFP and 
pNF-KB-Luc. 
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SEQUENCE LISTING 



<110> Li, Xianqiang 

Zhao, Xiaoning 
Fang, Yu 
Jiang, Xin 
Duong , Tommy 
Kain, Steve R. 

<120> Cis-Element Reporter Constructs and Uses Thereof 

<130> D6170PCT 

<141> 1999-12-01 

<150> US 09/206,887 

<151> 1998-12-08 

<160> 14 

<210> 1 

<211> 40 

<212> DNA 

<213> artificial sequence 
<220> 

<223> NF-KB element 

<400> 1 

gggaatttcc gggaatttcc gggaatttcc gggaatttcc 40 

<210> 2 

<211> 43 

<212> DNA 

<213> artificial sequence 
<220> 

<223> heat responsive element (HRE) 

<400> 2 

tagaatgttc tagatctaga acattctagc tagaatgttc tag 43 

<210> 3 

<211> 36 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Myc element 
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<400> 3 
cacgtgcacg tgcacgtgca cgtgcacgtg cacgtg 



36 



<210> 4 
<211> 157 
<212> DNA 

<213> artificial sequence 

<220> 

<223> p53 cis element 

<400> 4 

tcgagcttgc ctggacttgc ctgggccaga tctgtcgacg gaggtcgagc 50 
ttgcctggac ttgcctgccc acatctgtcg acggaggtcg agcttgcctg 100 
gacttgcctg cccacatctg tcgacggagg tcgagccttg cctggacttg 150 
cctgcca 157 



<210> 5 
<211> 90 
<212> DNA 

<213> artificial sequence 

<220> 

<223> nuclear factor of activated T cells (NF-AT) 

<400> 5 

ggaggaaaaa ctgtttcata cagaaggcgt ggaggaaaaa ctgtttcata 50 
cagaaggcgt ggaggaaaaa ctgtttcata cagaaggcgt 90 



<210> 6 

<211> 46 

<212> DNA 

<213> artificial sequence 
<220> 

<223> alkaline phosphatase 1 (API) 

<400> 6 

tgagtcagtg agtcactgac tcactgactc atgagtcagc tgactc 46 



<210> 7 
<211> 60 
<212> DNA 
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<213> artificial sequence 

<220> 

<223> serum responsive element (SRE) 

<400> 7 

gatgtccata ttaggacatc gatgtccgaa tatggacatc gatgtccata 50 
ttaggacatc 60 



<210> 8 

<211> 45 

<212> DNA 

<213> artificial sequence 
<220> 

<223> glucocorticoid responsive element (GRE) 

<400> 8 

ggtacatttt gttctagaac aaaatgtacc ggtacatttt gttct 45 

<210> 9 
<211> 87 
<212> DNA 

<213> artificial sequence 

<220> 

<223> cAMP responsive element (CRE3) 

<400> 9 

gcaccagaca gtgacgtcag ctgccagatc ccatggccgt catactgtga 50 
cgtctttcag acaccccatt gacgtcaatg ggagaac 87 

<210> 10 
<211> 130 
<212> DNA 

<213> artificial sequence 

<220> 

<223> cAMP responsive element (CRE5) 

<400> 10 

tcgagcccat ggccgtcata ctgtgacgtc cccatggccg tcatactgtg 50 
acgtccccat ggccgtcata ctgtgacgtc cccatggtcg tcatactgtg 100 
f acgtccccat ggccgtcata ctgtgacgtc 13 0 
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<210> 


11 


<211> 


4846 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


synthesized pNF-KB- 


<400> 


11 



50 
100 
150 
200 
250 
300 
350 
400 
450 



atttccggga atttccagat ctgccgcccc gactgcatct gcgtgttcga 
attcgccaat gacaagacgc tgggcggggt ttgtgtcatc atagaactaa 
agacatgcaa atatatttct tccggggaca ccgccagcaa acgcgagcaa 
cgggccacgg ggatgaagca gaagcttcga atcgcgaatt cgcccaccat 
gctgctgctg ctgctgctgc tgggcctgag gctacagctc tccctgggca 
tcatcccagt tgaggaggag aacccggact tctggaaccg cgaggcagcc 
gaggccctgg gtgccgccaa gaagctgcag cctgcacaga cagccgccaa 
gaacctcatc atcttcctgg gcgatgggat gggggtgtct acggtgacag 
ctgccaggat cctaaaaggg cagaagaagg acaaactggg gcctgagata 500 
cccctggcca tggaccgctt cccatatgtg gctctgtcca agacatacaa 550 
tgtagacaaa catgtgccag acagtggagc cacagccacg gcctacctgt 600 
gcggggtcaa gggcaacttc cagaccattg gcttgagtgc agccgcccgc 650 
tttaaccagt gcaacacgac acgcggcaac gaggtcatct ccgtgatgaa 700 
tcgggccaag aaagcaggga agtcagtggg agtggtaacc accacacgag 750 
tgcagcacgc ctcgccagcc ggcacctacg cccacacggt gaaccgcaac 800 
tggtactcgg acgccgacgt gcctgcctcg gcccgccagg aggggtgcca 850 
ggacatcgct acgcagctca tctccaacat ggacattgac gtgatcctag 900 
gtggaggccg aaagtacatg tttcgcatgg gaaccccaga ccctgagtac 950 
ccagatgact acagccaagg tgggaccagg ctggacggga agaatctggt 1000 
gcaggaatgg ctggcgaagc gccagggtgc ccggtatgtg tggaaccgca 1050 
ctgagctcat gcaggcttcc ctggacccgt ctgtgaccca tctcatgggt 1100 
ctctttgagc ctggagacat gaaatacgag atccaccgag actccacact 1150 
ggacccctcc ctgatggaga tgacagaggc tgccctgcgc ctgctgagca 1200 
ggaacccccg cggcttcttc ctcttcgtgg agggtggtcg catcgaccat 1250 
ggtcatcatg aaagcagggc ttaccgggca ctgactgaga cgatcatgtt 1300 
cgacgacgcc attgagaggg cgggccagct caccagcgag gaggacacgc 1350 
tgagcctcgt cactgccgac cactcccacg tcttctcctt cggaggctac 1400 
cccctgcgag ggagctccat cttcgggctg gcccctggca aggcccggga 1450 
caggaaggcc tacacggtcc tcctatacgg aaacggtcca ggctatgtgc 1500 
tcaaggacgg cgcccggccg gatgttaccg agagcgagag cgggagcccc 1550 

SEQ 4/14 
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gagtatcggc agcagtcagc agtgcccctg gacgaagaga cccacgcagg 1600 
cgaggacgtg gcggtgttcg cgcgcggccc gcaggcgcac ctggttcacg 1650 
gcgtgcagga gcagaccttc atagcgcacg tcatggcctt cgccgcctgc 1700 
ctggagccct acaccgcctg cgacctggcg ccccccgccg gcaccaccga 1750 
cgccgcgcac ccgggttact ctagagtcgg ggcggccggc cgcttcgagc 1800 
agacatgata agatacattg atgagtttgg acaaaccaca actagaatgc 1850 
agtgaaaaaa atgctttatt tgtgaaattt gtgatgctat tgctttattt 1900 
gtaaccatta taagctgcaa taaacaagtt aacaacaaca attgcattca 1950 
ttttatgttt caggttcagg gggaggtgtg ggaggttttt taaagcaagt 2000 
aaaacctcta caaatgtggt aaaatcgata aggatccgtc gaccgatgcc 2050 
cttgagagcc ttcaacccag tcagctcctt ccggtgggcg cggggcatga 2100 
ctatcgtcgc cgcacttatg actgtcttct ttatcatgca actcgtagga 2150 
caggtgccgg cagcgctctt ccgcttcctc gctcactgac tcgctgcgct 22 00 
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata 22 50 
cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 23 00 
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt 2350 
tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt 2400 
cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc 2450 
tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 25 00 
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca 2550 
cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg 2600 
tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact 2650 
atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca 2700 
gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga 2750 
gttcttgaag tggtggccta actacggcta cactagaagg acagtatttg 2800 
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc 2850 
tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg 2900 
caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga 2950 
tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg 3000 
attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa 3050 
ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 3100 
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt 3150 
ctatttcgtt catccatagt tgcctgactc cccgtcgtgt agataactac 3200 
gatacgggag ggcttaccat ctggccccag tgctgcaatg ataccgcgag 3250 
acccacgctc accggctcca gatttatcag caataaacca gccagccgga 33 00 
agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc 3350 
tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt 3400 
tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg 3450 
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tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac 3500 

atgatccccc atgttgtgca aaaaagcggt tagctccttc ggtcctccga 3550 

tcgttgtcag aagtaagttg gccgcagtgt tatcactcat ggttatggca 3 600 

gcactgcata attctcttac tgtcatgcca tccgtaagat gcttttctgt 3650 

gactggtgag tactcaacca agtcattctg agaatagtgt atgcggcgac 3700 

cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc 3750 

agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact 3800 

ctcaaggatc ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg 3850 

cacccaactg atcttcagca tcttttactt tcaccagcgt ttctgggtga 3900 

gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg 3950 

gaaatgttga atactcatac tcttcctttt tcaatattat tgaagcattt 4000 

atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa 4050 

aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga 4100 

cgcgccctgt agcggcgcat taagcgcggc gggtgtggtg gttacgcgca 4150 

gcgtgaccgc tacacttgcc agcgccctag cgcccgctcc tttcgctttc 42 00 

ttcccttcct ttctcgccac gttcgccggc tttccccgtc aagctctaaa 4250 

tcgggggctc cctttagggt tccgatttag tgctttacgg cacctcgacc 4300 

ccaaaaaact tgattagggt gatggttcac gtagtgggcc atcgccctga 43 50 

tagacggttt ttcgcccttt gacgttggag tccacgttct ttaatagtgg 4400 

actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt 4450 

ttgatttata agggattttg ccgatttcgg cctattggtt aaaaaatgag 4500 

ctgatttaac aaaaatttaa cgcgaatttt aacaaaatat taacgtttac 4550 

aatttcccat tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg 4600 

tgcgggcctc ttcgctatta cgccagccca agctaccatg ataagtaagt 4650 

aatattaagg tacgggaggt acttggagcg gccgcaataa aatatcttta 4700 

ttttcattac atctgtgtgt tggttttttg tgtgaatcga tagtactaac 4750 

atacgctctc catcaaaaca aaacgaaaca aaacaaacta gcaaaatagg 4800 

ctgtccccag tgcaagtgca ggtgccagaa catttctcta tcgata 4846 

<210> 12 
<211> 4163 
<212> DNA 

<213> artificial sequence 

<220> 

<223> synthesized pNF-KB-d2EGFP construct 

<400> 12 

ggtaccgagc tcttacgcgt gctagcggga atttccggga atttccggga 50 
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atttccggga atttccagat ctgccgcccc 
attcgccaat gacaagacgc tgggcggggt 
agacatgcaa atatatttct tccggggaca 
cgggccacgg ggatgaagca gaagcttccc 
agggcgagga gctgttcacc ggggtggtgc 
ggcgacgtaa acggccacaa gttcagcgtg 
tgccacctac ggcaagctga ccctgaagtt 
tgcccgtgcc ctggcccacc ctcgtgacca 
tgcttcagcc gctaccccga ccacatgaag 
cgccatgccc gaaggctacg tccaggagcg 
acggcaacta caagacccgc gccgaggtga 
gtgaaccgca tcgagctgaa gggcatcgac 
cc tggggcac aagctggagt acaactacaa 
tggccgacaa gcagaagaac ggcatcaagg 
aacatcgagg acggcagcgt gcagctcgcc 
ccccatcggc gacggccccg tgctgctgcc 
cccagtccgc cctgagcaaa gaccccaacg 
ctgctggagt tcgtgaccgc cgccgggatc 
gtacaagaag cttagccatg gcttcccgcc 
atggcacgct gcccatgtct tgtgcccagg 
cctgcagcct gtgcttctgc taggatcaat 
ggccggccgc ttcgagcaga catgataaga 
aaccacaact agaatgcagt gaaaaaaatg 
atgctattgc tttatttgta accattataa 
aacaacaatt gcattcattt tatgtttcag 
ggttttttaa agcaagtaaa acctctacaa 
atccgtcgac cgatgccctt gagagccttc 
gtgggcgcgg ggcatgacta tcgtcgccgc 
tcatgcaact cgtaggacag gtgccggcag 
cactgactcg ctgcgctcgg tcgttcggct 
actcaaaggc ggtaatacgg ttatccacag 
aagaacatgt gagcaaaagg ccagcaaaag 
cgcgttgctg gcgtttttcc ataggctccg 
aaaatcgacg ctcaagtcag aggtggcgaa 
taccaggcgt ttccccctgg aagctccctc 
cctgccgctt accggatacc tgtccgcctt 
cgctttctca tagctcacgc tgtaggtatc 
cgctccaagc tgggctgtgt gcacgaaccc 



PCT/1JS99/28451 

gactgcatct gcgtgttcga 100 
ttgtgtcatc atagaactaa 150 
ccgccagcaa acgcgagcaa 2 00 
cgaattccgg atggtgagca 250 
ccatcctggt cgagctggac 3 00 
tccggcgagg gcgagggcga 350 
catctgcacc accggcaagc 400 
ccctgaccta cggcgtgcag 450 
cagcacgact tcttcaagtc 500 
caccatcttc ttcaaggacg 550 
agttcgaggg cgacaccctg 600 
ttcaaggagg acggcaacat 650 
cagccacaac gtctatatca 700 
tgaacttcaa gatccgccac 750 
gaccactacc agcagaacac 800 
cgacaaccac tacctgagca 850 
agaagcgcga tcacatggtc 900 
actctcggca tggacgagct 950 
ggaggtggag gagcaggatg 1000 
agagcgggat ggaccgtcac 1050 
gtgtagtcta gagtcggggc 1100 
tacattgatg agtttggaca 1150 
ctttatttgt gaaatttgtg 1200 
gctgcaataa acaagttaac 12 50 
gttcaggggg aggtgtggga 13 00 
atgtggtaaa atcgataagg 1350 
aacccagtca gctccttccg 1400 
acttatgact gtcttcttta 1450 
cgctcttccg cttcctcgct 1500 
gcggcgagcg gtatcagctc 1550 
aatcagggga taacgcagga 1600 
gccaggaacc gtaaaaaggc 1650 
cccccctgac gagcatcaca 1700 
acccgacagg actataaaga 1750 
gtgcgctctc ctgttccgac 1800 
tctcccttcg ggaagcgtgg 1850 
tcagttcggt gtaggtcgtt 1900 
cccgttcagc ccgaccgctg 1950 
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cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact 2000 
tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat 2050 
gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac 2100 
tagaaggaca gtatttggta tctgcgctct gctgaagcca gttaccttcg 2150 
gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 2200 
ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc 2250 
tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg 2300 
aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 2350 
acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat 2400 
atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac 2450 
ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc 2500 
gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc 2550 
tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa 2600 
taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta 2650 
tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag 2700 
ttcgccagtt aatagtttgc gcaacgttgt tgccattgct acaggcatcg 2750 
tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 2800 
cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag 2850 
ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat 2900 
cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc 2950 
gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga 3000 
atagtgtatg cggcgaccga gttgctcttg cccggcgtca atacgggata 3050 
ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt 3100 
tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc 3150 
gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca 3200 
ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 3250 
ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca 3300 
atattattga agcatttatc agggttattg tctcatgagc ggatacatat 3350 
ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc 3400 
cgaaaagtgc cacctgacgc gccctgtagc ggcgcattaa gcgcggcggg 3450 
tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc 3500 
ccgctccttt cgctttcttc ccttcctttc tcgccacgtt cgccggcttt 3550 
ccccgtcaag ctctaaatcg ggggctccct ttagggttcc gatttagtgc 3600 
tttacggcac ctcgacccca aaaaacttga ttagggtgat ggttcacgta 3 650 
gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc 3700 
acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc 3750 
tatctcggtc tattcttttg atttataagg gattttgccg atttcggcct 3800 
attggttaaa aaatgagctg atttaacaaa aatttaacgc gaattttaac 3850 



SEQ 8/14 



aaaatattaa cgtttacaat ttcccattcg 
ttgggaaggg cgatcggtgc gggcctcttc 
taccatgata agtaagtaat attaaggtac 
gcaataaaat atctttattt tcattacatc 
gaatcgatag tactaacata cgctctccat 
caaactagca aaataggctg tccccagtgc 
ttctctatcg ata 



PCTAJS99/2845 1 

ccattcaggc tgcgcaactg 3900 
gctattacgc cagcccaagc 3950 
gggaggtact tggagcggcc 4000 
tgtgtgttgg ttttttgtgt 4050 
caaaacaaaa cgaaacaaaa 4100 
aagtgcaggt gccagaacat 4150 

4163 



<210> 13 

<211> 4300 

<212> DNA 

<213> artificial sequence 
<220> 



<223> 


synthesized pCRE5- 


-d2EGFP 






<400> 


13 










ggtaccgagc 


tcttacgcgt 


gctagcggga 


atttccggga 


atttccggga 


50 


atttccggga 


atttccagat 


cctcgagccc 


atggccgtca 


tactgtgacg 


100 


tccccatggc 


cgtcatactg 


tgacgtcccc 


atggccgtca 


tactgtgacg 


150 


tccccatggt 


cgtcatactg 


tgacgtcccc 


atggccgtca 


tactgtgacg 


200 


tcccgcggtc 


gactaccctt 


caatcattgg 


atggaatttc 


ctgttgatcc 


250 


cagggcttag 


atgcaggtgg 


aaacactctg 


ctggtataaa 


agcaggtgag 


300 


gacttcatta 


actgcagtta 


ctgagaactc 


ataagacgaa 


gctaaaatcc 


350 


ctcttcggaa 


gcttccccga 


attccggatg 


gtgagcaagg 


gcgaggagct 


400 


gttcaccggg 


gtggtgcc.ca 


tcctggtcga 


gctggacggc 


gacgtaaacg 


450 


gccacaagtt 


cagcgtgtcc 


ggcgagggcg 


agggcgatgc 


cacctacggc 


500 


aagctgaccc 


tgaagttcat 


ctgcaccacc 


ggcaagctgc 


ccgtgccctg 


550 


gcccaccctc 


gtgaccaccc 


tgacctacgg 


cgtgcagtgc 


ttcagccgct 


600 


accccgacca 


catgaagcag 


cacgacttct 


tcaagtccgc 


catgcccgaa 


650 


ggctacgtcc 


aggagcgcac 


catcttcttc 


aaggacgacg 


gcaactacaa 


700 


gacccgcgcc 


gaggtgaagt 


tcgagggcga 


caccctggtg 


aaccgcatcg 


750 


agctgaaggg 


catcgacttc 


aaggaggacg 


gcaacatcct 


ggggcacaag 


800 


ctggagtaca 


actacaacag 


ccacaacgtc 


tatatcatgg 


ccgacaagca 


850 


gaagaacggc 


atcaaggtga 


acttcaagat 


ccgccacaac 


atcgaggacg 


900 


gcagcgtgca 


gctcgccgac 


cactaccagc 


agaacacccc 


catcggcgac 


950 


ggccccgtgc 


tgctgcccga 


caaccactac 


ctgagcaccc 


agtccgccct 


1000 


gagcaaagac 


cccaacgaga 


agcgcgatca 


catggtcctg 


ctggagttcg 


1050 


tgaccgccgc 


cgggatcact 


ctcggcatgg 


acgagctgta 


caagaagctt 


1100 


agccatggct 


tcccgccgga 


ggtggaggag 


caggatgatg 


gcacgctgcc 


1150 
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catgtcttgt gcccaggaga gcgggatgga ccgtcaccct gcagcctgtg 1200 
cttctgctag gatcaatgtg tagtctagag tcggggcggc cggccgcttc 1250 
gagcagacat gataagatac attgatgagt ttggacaaac cacaactaga 1300 
atgcagtgaa aaaaatgctt tatttgtgaa atttgtgatg ctattgcttt 1350 
atttgtaacc attataagct gcaataaaca agttaacaac aacaattgca 1400 
ttcattttat gtttcaggtt cagggggagg tgtgggaggt tttttaaagc 1450 
aagtaaaacc tctacaaatg tggtaaaatc gataaggatc cgtcgaccga 1500 
tgcccttgag agccttcaac ccagtcagct ccttccggtg ggcgcggggc 1550 
atgactatcg tcgccgcact tatgactgtc ttctttatca tgcaactcgt 1600 
aggacaggtg ccggcagcgc tcttccgctt cctcgctcac tgactcgctg 1650 
cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt 1700 
aatacggtta tccacagaat caggggataa cgcaggaaag aacatgtgag 1750 
caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg 1800 
tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc 1850 
aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc 1900 
cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc 1950 
ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcatag 2000 
ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc tccaagctgg 2050 
gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt 2100 
aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc 2150 
agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta 2200 
cagagttctt gaagtggtgg cctaactacg gctacactag aaggacagta 2250 
tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg 23 00 
tagctcttga tccggcaaac aaaccaccgc tggtagcggt ggtttttttg 23 50 
tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct 2400 
ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta 2450 
agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt 2500 
taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact 2550 
tggtctgaca gttaccaatg cttaatcagt gaggcaccta tctcagcgat 2600 
ctgtctattt cgttcatcca tagttgcctg actccccgtc gtgtagataa 2650 
ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg 2700 
cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc 2750 
cggaagggcc gagcgcagaa gtggtcctgc aactttatcc gcctccatcc 2800 
agtctattaa ttgttgccgg gaagctagag taagtagttc gccagttaat 2850 
agtttgcgca acgttgttgc cattgctaca ggcatcgtgg tgtcacgctc 2900 
gtcgtttggt atggcttcat tcagctccgg ttcccaacga tcaaggcgag 2950 
ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct 3000 
ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat 3050 
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ggcagcactg cataattctc ttactgtcat gccatccgta agatgctttt 3100 
ctgtgactgg tgagtactca accaagtcat tctgagaata gtgtatgcgg 3150 
cgaccgagtt gctcttgccc ggcgtcaata cgggataata ccgcgccaca 3200 
tagcagaact ttaaaagtgc tcatcattgg aaaacgttct tcggggcgaa 3250 
aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact 3300 
cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg 3350 
gtgagcaaaa acaggaaggc aaaatgccgc aaaaaaggga ataagggcga 3400 
cacggaaatg ttgaatactc atactcttcc tttttcaata ttattgaagc 3450 
atttatcagg gttattgtct catgagcgga tacatatttg aatgtattta 3500 
gaaaaataaa caaatagggg ttccgcgcac atttccccga aaagtgccac 3550 
ctgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg 3600 
cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc 3650 
tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 3700 
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc 3750 
gaccccaaaa aacttgatta gggtgatggt tcacgtagtg ggccatcgcc 3800 
ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata 3850 
gtggactctt gttccaaact ggaacaacac tcaaccctat ctcggtctat 3900 
tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa 3950 
tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt 4000 
ttacaatttc ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 4050 
tcggtgcggg cctcttcgct attacgccag cccaagctac catgataagt 4100 
aagtaatatt aaggtacggg aggtacttgg agcggccgca ataaaatatc 4150 
tttattttca ttacatctgt gtgttggttt tttgtgtgaa tcgatagtac 4200 
taacatacgc tctccatcaa aacaaaacga aacaaaacaa actagcaaaa 4250 
taggctgtcc ccagtgcaag tgcaggtgcc agaacatttc tctatcgata 43 00 



<210> 14 
<211> 4987 
<212> DNA 

<213> artificial sequence 

<220> 

<223> synthesized pNF-KB-Luc 

<400> 14 

ggtaccgagc tcttacgcgt gctagcggga atttccggga atttccggga 50 
atttccggga atttccagat ctgccgcccc gactgcatct gcgtgttcga 100 
attcgccaat gacaagacgc tgggcggggt ttgtgtcatc atagaactaa 150 
agacatgcaa atatatttct tccggggaca ccgccagcaa acgcgagcaa 200 
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cgggccacgg ggatgaagca gaagcttggc attccggtac tgttggtaaa 250 
gccaccatgg aagacgccaa aaacataaag aaaggcccgg cgccattcta 3 00 
tccgctggaa gatggaaccg ctggagagca actgcataag gctatgaaga 3 50 
gatacgccct ggttcctgga acaattgctt ttacagatgc acatatcgag 400 
gtggacatca cttacgctga gtacttcgaa atgtccgttc ggttggcaga 450 
agctatgaaa cgatatgggc tgaatacaaa tcacagaatc gtcgtatgca 500 
gtgaaaactc tcttcaattc tttatgccgg tgttgggcgc gttatttatc 550 
ggagttgcag ttgcgcccgc gaacgacatt tataatgaac gtgaattgct 600 
caacagtatg ggcatttcgc agcctaccgt ggtgttcgtt tccaaaaagg 650 
ggttgcaaaa aattttgaac gtgcaaaaaa agctcccaat catccaaaaa 700 
attattatca tggattctaa aacggattac cagggatttc agtcgatgta 750 
cacgttcgtc acatctcatc tacctcccgg ttttaatgaa tacgattttg 800 
tgccagagtc cttcgatagg gacaagacaa ttgcactgat catgaactcc 850 
tctggatcta ctggtctgcc taaaggtgtc gctctgcctc atagaactgc 900 
ctgcgtgaga ttctcgcatg ccagagatcc tatttttggc aatcaaatca 950 
ttccggatac tgcgatttta agtgttgttc cattccatca cggttttgga 1000 
atgtttacta cactcggata tttgatatgt ggatttcgag tcgtcttaat 1050 
gtatagattt gaagaagagc tgtttctgag gagccttcag gattacaaga 1100 
ttcaaagtgc gctgctggtg ccaaccctat tctccttctt cgccaaaage 1150 
actctgattg acaaatacga tttatctaat ttacacgaaa ttgcttctgg 1200 
tggcgctccc ctctctaagg aagtcgggga agcggttgcc aagaggttcc 1250 
atctgccagg tatcaggcaa ggatatgggc tcactgagac tacatcagct 1300 
attctgatta cacccgaggg ggatgataaa ccgggcgcgg tcggtaaagt 1350 
tgttccattt tttgaagcga aggttgtgga tctggatacc gggaaaacgc 1400 
tgggcgttaa tcaaagaggc gaactgtgtg tgagaggtcc tatgattatg 1450 
tccggttatg taaacaatcc ggaagcgacc aacgccttga ttgacaagga 1500 
tggatggcta cattctggag acatagctta ctgggacgaa gacgaacact 1550 
tcttcatcgt tgaccgcctg aagtctctga ttaagtacaa aggctatcag 1600 
gtggctcccg ctgaattgga atccatcttg ctccaacacc ccaacatctt 1650 
cgacgcaggt gtcgcaggtc ttcccgacga tgacgccggt gaacttcccg 1700 
ccgccgttgt tgttttggag cacggaaaga cgatgacgga aaaagagatc 1750 
gtggattacg tcgccagtca agtaacaacc gcgaaaaagt tgcgcggagg 1800 
agttgtgttt gtggacgaag taccgaaagg tcttaccgga aaactcgacg 1850 
caagaaaaat cagagagatc ctcataaagg ccaagaaggg cggaaagatc 1900 
gccgtgtaat tctagagtcg gggcggccgg ccgcttcgag cagacatgat 1950 
aagatacatt gatgagtttg gacaaaccac aactagaatg cagtgaaaaa 2000 
aatgctttat ttgtgaaatt tgtgatgcta ttgctttatt tgtaaccatt 2050 
ataagctgca ataaacaagt taacaacaac aattgcattc attttatgtt 2100 
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tcaggttcag 
acaaatgtgg 
cttcaaccca 
ccgcacttat 
gcagcgctct 
ggctgcggcg 
acagaatcag 
aaaggccagg 
tccgcccccc 
cgaaacccga 
cctcgtgcgc 
cctttctccc 
tatctcagtt 
accccccgtt 
agtccaaccc 
aacaggatta 
gtggtggcct 
ctctgctgaa 
ggcaaacaaa 
gattacgcgc 
cggggtctga 
atgagattat 
aagttttaaa 
accaatgctt 
tcatccatag 
gggcttacca 
caccggctcc 
cgcagaagtg 
ttgccgggaa 
ttgttgccat 
gcttcattca 
catgttgtgc 
gaagtaagtt 
aattctctta 
gtactcaacc 
cttgcccggc 
aaagtgctca 
cttaccgctg 



ggggaggtgt 
taaaatcgat 
gtcagctcct 
gactgtcttc 
tccgcttcct 
agcggtatca 
gggataacgc 
aaccgtaaaa 
tgacgagcat 
caggactata 
tctcctgttc 
ttcgggaagc 
cggtgtaggt 
cagcccgacc 
ggtaagacac 
gcagagcgag 
aactacggct 
gccagttacc 
ccaccgctgg 
agaaaaaaag 
cgctcagtgg 
caaaaaggat 
tcaatctaaa 
aatcagtgag 
ttgcctgact 
tctggcccca 
agatttatca 
gtcctgcaac 
gctagagtaa 
tgctacaggc 
gctccggttc 
aaaaaagcgg 
ggccgcagtg 
ctgtcatgcc 
aagtcattct 
gtcaatacgg 
tcattggaaa 
ttgagatcca 



gggaggtttt 
aaggatccgt 
tccggtgggc 
tttatcatgc 
cgctcactga 
gctcactcaa 
aggaaagaac 
aggccgcgtt 
cacaaaaatc 
aagataccag 
cgaccctgcc 
gtggcgcttt 
cgttcgctcc 
gctgcgcctt 
gacttatcgc 
gtatgtaggc 
acactagaag 
ttcggaaaaa 
tagcggtggt 
gatctcaaga 
aacgaaaact 
cttcacctag 
gtatatatga 
gcacctatct 
ccccgtcgtg 
gtgctgcaat 
gcaataaacc 
tttatccgcc 
gtagttcgcc 
atcgtggtgt 
ccaacgatca 
ttagctcctt 
ttatcactca 
atccgtaaga 
gagaatagtg 
gataataccg 
acgttcttcg 
gttcgatgta 



ttaaagcaag 
cgaccgatgc 
gcggggcatg 
aactcgtagg 
ctcgctgcgc 
aggcggtaat 
atgtgagcaa 
gctggcgttt 
gacgctcaag 
gcgtttcccc 
gcttaccgga 
ctcatagctc 
aagctgggct 
atccggtaac 
cactggcagc 
ggtgctacag 
gacagtattt 
gagttggtag 
ttttttgttt 
agatcctttg 
cacgttaagg 
atccttttaa 
gtaaacttgg 
cagcgatctg 
tagataacta 
gataccgcga 
agccagccgg 
tccatccagt 
agttaatagt 
cacgctcgtc 
aggcgagtta 
cggtcctccg 
tggttatggc 
tgcttttctg 
tatgcggcga 
cgccacatag 
gggcgaaaac 
acccactcgt 
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taaaacc tc t 2150 
ccttgagagc 2200 
actatcgtcg 2250 
acaggtgccg 2300 
tcggtcgttc 2350 
acggttatcc 2400 
aaggccagca 2450 
ttccataggc 2500 
tcagaggtgg 2550 
c tggaagc t c 2 600 
tacctgtccg 2650 
acgctgtagg 2700 
gtgtgcacga 2750 
tatcgtcttg 2800 
agccactggt 2850 
agttcttgaa 2900 
ggtatctgcg 2950 
ctcttgatcc 3000 
gcaagcagca 3050 
atcttttcta 3100 
gattttggtc 3150 
attaaaaatg 3200 
tctgacagtt 3250 
tctatttcgt 3300 
cgatacggga 3 350 
gacccacgct 3 400 
aagggccgag 3450 
ctattaattg 3500 
ttgcgcaacg 3550 
gtttggtatg 3 600 
catgatcccc 3 650 
atcgttgtca 3700 
agcactgcat 3750 
tgactggtga 3800 
ccgagttgct 3850 
cagaacttta 3900 
tctcaaggat 3950 
gcacccaact 4000 
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gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca 4050 
ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg 4100 
aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt 4150 
attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa 4200 
ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgcgccctg 4250 
tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg 4300 
ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc 4350 
tttctcgcca cgttcgccgg ctttccccgt caagctctaa atcgggggct 4400 
ccctttaggg ttccgattta gtgctttacg gcacctcgac cccaaaaaac 4450 
ttgattaggg tgatggttca cgtagtgggc catcgccctg atagacggtt 4500 
tttcgccctt tgacgttgga gtccacgttc tttaatagtg gactcttgtt 4550 
ccaaactgga acaacactca accctatctc ggtctattct tttgatttat 4600 
aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa 4650 
caaaaattta acgcgaattt taacaaaata ttaacgttta caatttccca 47 00 
ttcgccattc aggctgcgca actgttggga agggcgatcg gtgcgggcct 4750 
cttcgctatt acgccagccc aagctaccat gataagtaag taatattaag 4800 
gtacgggagg tacttggagc ggccgcaata aaatatcttt attttcatta 4850 
catctgtgtg ttggtttttt gtgtgaatcg atagtactaa catacgctct 4900 
ccatcaaaac aaaacgaaac aaaacaaact agcaaaatag gctgtcccca 4950 
gtgcaagtgc aggtgccaga acatttctct atcgata 4987 
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